Dataset statistics
| Number of variables | 16 |
|---|---|
| Number of observations | 7907 |
| Missing cells | 5518 |
| Missing cells (%) | 4.4% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 988.5 KiB |
| Average record size in memory | 128.0 B |
Variable types
| NUM | 10 |
|---|---|
| CAT | 6 |
name has a high cardinality: 7457 distinct values | High cardinality |
host_name has a high cardinality: 1833 distinct values | High cardinality |
last_review has a high cardinality: 1001 distinct values | High cardinality |
neighbourhood is highly correlated with neighbourhood_group | High correlation |
neighbourhood_group is highly correlated with neighbourhood | High correlation |
last_review has 2758 (34.9%) missing values | Missing |
reviews_per_month has 2758 (34.9%) missing values | Missing |
name is uniformly distributed | Uniform |
id has unique values | Unique |
number_of_reviews has 2758 (34.9%) zeros | Zeros |
availability_365 has 1386 (17.5%) zeros | Zeros |
Reproduction
| Analysis started | 2020-10-14 15:03:48.629469 |
|---|---|
| Analysis finished | 2020-10-14 15:07:19.217779 |
| Duration | 3 minutes and 30.59 seconds |
| Software version | pandas-profiling v2.9.0 |
| Download configuration | config.yaml |
| Distinct | 7907 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 23388624.63 |
|---|---|
| Minimum | 49091 |
| Maximum | 38112762 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 61.8 KiB |
Quantile statistics
| Minimum | 49091 |
|---|---|
| 5-th percentile | 5219982.7 |
| Q1 | 15821800.5 |
| median | 24706270 |
| Q3 | 32348500 |
| 95-th percentile | 37030477.3 |
| Maximum | 38112762 |
| Range | 38063671 |
| Interquartile range (IQR) | 16526699.5 |
Descriptive statistics
| Standard deviation | 10164162.07 |
|---|---|
| Coefficient of variation (CV) | 0.4345771599 |
| Kurtosis | -0.9351301902 |
| Mean | 23388624.63 |
| Median Absolute Deviation (MAD) | 8146858 |
| Skewness | -0.4293241662 |
| Sum | 1.849338549e+11 |
| Variance | 1.033101905e+14 |
| Monotocity | Strictly increasing |
| Value | Count | Frequency (%) | |
| 22890495 | 1 | < 0.1% | |
| 18886111 | 1 | < 0.1% | |
| 50646 | 1 | < 0.1% | |
| 30721506 | 1 | < 0.1% | |
| 16743909 | 1 | < 0.1% | |
| 23604711 | 1 | < 0.1% | |
| 19797480 | 1 | < 0.1% | |
| 24211284 | 1 | < 0.1% | |
| 5975052 | 1 | < 0.1% | |
| 21671407 | 1 | < 0.1% | |
| Other values (7897) | 7897 | 99.9% |
| Value | Count | Frequency (%) | |
| 49091 | 1 | < 0.1% | |
| 50646 | 1 | < 0.1% | |
| 56334 | 1 | < 0.1% | |
| 71609 | 1 | < 0.1% | |
| 71896 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 38112762 | 1 | < 0.1% | |
| 38110493 | 1 | < 0.1% | |
| 38109336 | 1 | < 0.1% | |
| 38108273 | 1 | < 0.1% | |
| 38105126 | 1 | < 0.1% |
| Distinct | 7457 |
|---|---|
| Distinct (%) | 94.3% |
| Missing | 2 |
| Missing (%) | < 0.1% |
| Memory size | 61.8 KiB |
| Luxury hostel with in-cabin locker - Single mixed | 13 |
|---|---|
| Studio Apartment - Oakwood Premier | 9 |
| Inviting & Cozy 1BR APT 3 mins from Tg Pagar MRT | 9 |
| Tasteful & Cozy 1 BR near SGH/Tiong Bahru | 8 |
| Superhost 1BR APT in the heart of Tg Pagar | 8 |
| Other values (7452) |
| Value | Count | Frequency (%) | |
| Luxury hostel with in-cabin locker - Single mixed | 13 | 0.2% | |
| Studio Apartment - Oakwood Premier | 9 | 0.1% | |
| Inviting & Cozy 1BR APT 3 mins from Tg Pagar MRT | 9 | 0.1% | |
| Tasteful & Cozy 1 BR near SGH/Tiong Bahru | 8 | 0.1% | |
| Superhost 1BR APT in the heart of Tg Pagar | 8 | 0.1% | |
| Stylish 1BR Located 7 mins from Tg Pagar MRT | 8 | 0.1% | |
| City-located 1BR loft apartment *BRAND NEW* | 8 | 0.1% | |
| Furnished, Homely 2BR APT near Bouna Vista MRT | 7 | 0.1% | |
| City-located studio loft apartment *BRAND NEW* | 7 | 0.1% | |
| Single Capsule For 1 (Free Breakfast) | 7 | 0.1% | |
| Other values (7447) | 7821 | 98.9% |
Unique
| Unique | 7192 ? |
|---|---|
| Unique (%) | 91.0% |
Length
| Max length | 99 |
|---|---|
| Median length | 40 |
| Mean length | 37.99822942 |
| Min length | 1 |
host_id
Real number (ℝ≥0)
| Distinct | 2705 |
|---|---|
| Distinct (%) | 34.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 91144807.41 |
|---|---|
| Minimum | 23666 |
| Maximum | 288567551 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 61.8 KiB |
Quantile statistics
| Minimum | 23666 |
|---|---|
| 5-th percentile | 3356540.3 |
| Q1 | 23058075 |
| median | 63448912 |
| Q3 | 155381142 |
| 95-th percentile | 248196938 |
| Maximum | 288567551 |
| Range | 288543885 |
| Interquartile range (IQR) | 132323067 |
Descriptive statistics
| Standard deviation | 81909095.31 |
|---|---|
| Coefficient of variation (CV) | 0.8986699038 |
| Kurtosis | -0.7718327787 |
| Mean | 91144807.41 |
| Median Absolute Deviation (MAD) | 52147950 |
| Skewness | 0.7373540654 |
| Sum | 7.206819922e+11 |
| Variance | 6.709099894e+15 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 66406177 | 274 | 3.5% | |
| 8492007 | 203 | 2.6% | |
| 209913841 | 157 | 2.0% | |
| 29420853 | 141 | 1.8% | |
| 31464513 | 114 | 1.4% | |
| 219550151 | 113 | 1.4% | |
| 2413412 | 112 | 1.4% | |
| 108773366 | 109 | 1.4% | |
| 23722617 | 84 | 1.1% | |
| 8948251 | 83 | 1.0% | |
| Other values (2695) | 6517 | 82.4% |
| Value | Count | Frequency (%) | |
| 23666 | 1 | < 0.1% | |
| 59498 | 3 | < 0.1% | |
| 165209 | 2 | < 0.1% | |
| 184596 | 1 | < 0.1% | |
| 227796 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 288567551 | 1 | < 0.1% | |
| 288546201 | 1 | < 0.1% | |
| 288249975 | 1 | < 0.1% | |
| 288110467 | 1 | < 0.1% | |
| 288016519 | 2 | < 0.1% |
| Distinct | 1833 |
|---|---|
| Distinct (%) | 23.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 61.8 KiB |
| Jay | 290 |
|---|---|
| Alvin | 249 |
| Richards | 157 |
| Aaron | 145 |
| Rain | 115 |
| Other values (1828) |
| Value | Count | Frequency (%) | |
| Jay | 290 | 3.7% | |
| Alvin | 249 | 3.1% | |
| Richards | 157 | 2.0% | |
| Aaron | 145 | 1.8% | |
| Rain | 115 | 1.5% | |
| Darcy | 114 | 1.4% | |
| Kaurus | 112 | 1.4% | |
| RedDoorz | 109 | 1.4% | |
| Alex | 105 | 1.3% | |
| Joey | 94 | 1.2% | |
| Other values (1823) | 6417 | 81.2% |
Unique
| Unique | 1092 ? |
|---|---|
| Unique (%) | 13.8% |
Length
| Max length | 35 |
|---|---|
| Median length | 5 |
| Mean length | 5.890603263 |
| Min length | 1 |
| Distinct | 5 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 61.8 KiB |
| Central Region | |
|---|---|
| West Region | 540 |
| East Region | 508 |
| North-East Region | 346 |
| North Region | 204 |
| Value | Count | Frequency (%) | |
| Central Region | 6309 | 79.8% | |
| West Region | 540 | 6.8% | |
| East Region | 508 | 6.4% | |
| North-East Region | 346 | 4.4% | |
| North Region | 204 | 2.6% |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Length
| Max length | 17 |
|---|---|
| Median length | 14 |
| Mean length | 13.68205388 |
| Min length | 11 |
| Distinct | 43 |
|---|---|
| Distinct (%) | 0.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 61.8 KiB |
| Kallang | |
|---|---|
| Geylang | |
| Novena | |
| Rochor | |
| Outram | |
| Other values (38) |
| Value | Count | Frequency (%) | |
| Kallang | 1043 | 13.2% | |
| Geylang | 994 | 12.6% | |
| Novena | 537 | 6.8% | |
| Rochor | 536 | 6.8% | |
| Outram | 477 | 6.0% | |
| Bukit Merah | 470 | 5.9% | |
| Downtown Core | 428 | 5.4% | |
| Bedok | 373 | 4.7% | |
| River Valley | 362 | 4.6% | |
| Queenstown | 266 | 3.4% | |
| Other values (33) | 2421 | 30.6% |
Unique
| Unique | 3 ? |
|---|---|
| Unique (%) | < 0.1% |
Length
| Max length | 23 |
|---|---|
| Median length | 7 |
| Mean length | 8.419501707 |
| Min length | 4 |
latitude
Real number (ℝ≥0)
| Distinct | 4885 |
|---|---|
| Distinct (%) | 61.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.314192465 |
|---|---|
| Minimum | 1.24387 |
| Maximum | 1.45459 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 61.8 KiB |
Quantile statistics
| Minimum | 1.24387 |
|---|---|
| 5-th percentile | 1.277909 |
| Q1 | 1.295795 |
| median | 1.31103 |
| Q3 | 1.32211 |
| 95-th percentile | 1.377771 |
| Maximum | 1.45459 |
| Range | 0.21072 |
| Interquartile range (IQR) | 0.026315 |
Descriptive statistics
| Standard deviation | 0.03057744427 |
|---|---|
| Coefficient of variation (CV) | 0.02326709754 |
| Kurtosis | 4.139245158 |
| Mean | 1.314192465 |
| Median Absolute Deviation (MAD) | 0.0133 |
| Skewness | 1.722879931 |
| Sum | 10391.31982 |
| Variance | 0.0009349800983 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 1.31141 | 9 | 0.1% | |
| 1.31125 | 8 | 0.1% | |
| 1.31137 | 8 | 0.1% | |
| 1.31403 | 7 | 0.1% | |
| 1.28376 | 7 | 0.1% | |
| 1.31163 | 7 | 0.1% | |
| 1.31102 | 6 | 0.1% | |
| 1.31565 | 6 | 0.1% | |
| 1.31244 | 6 | 0.1% | |
| 1.31523 | 6 | 0.1% | |
| Other values (4875) | 7837 | 99.1% |
| Value | Count | Frequency (%) | |
| 1.24387 | 1 | < 0.1% | |
| 1.24391 | 1 | < 0.1% | |
| 1.24526 | 1 | < 0.1% | |
| 1.24627 | 1 | < 0.1% | |
| 1.24847 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 1.45459 | 1 | < 0.1% | |
| 1.45328 | 1 | < 0.1% | |
| 1.45301 | 1 | < 0.1% | |
| 1.45265 | 1 | < 0.1% | |
| 1.45203 | 1 | < 0.1% |
longitude
Real number (ℝ≥0)
| Distinct | 5414 |
|---|---|
| Distinct (%) | 68.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 103.8487875 |
|---|---|
| Minimum | 103.64656 |
| Maximum | 103.97342 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 61.8 KiB |
Quantile statistics
| Minimum | 103.64656 |
|---|---|
| 5-th percentile | 103.759509 |
| Q1 | 103.835825 |
| median | 103.84941 |
| Q3 | 103.872535 |
| 95-th percentile | 103.912734 |
| Maximum | 103.97342 |
| Range | 0.32686 |
| Interquartile range (IQR) | 0.03671 |
Descriptive statistics
| Standard deviation | 0.04367464259 |
|---|---|
| Coefficient of variation (CV) | 0.0004205599666 |
| Kurtosis | 1.970240678 |
| Mean | 103.8487875 |
| Median Absolute Deviation (MAD) | 0.01537 |
| Skewness | -0.738700154 |
| Sum | 821132.3624 |
| Variance | 0.001907474405 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 103.86022 | 7 | 0.1% | |
| 103.85361 | 7 | 0.1% | |
| 103.84294 | 7 | 0.1% | |
| 103.84523 | 6 | 0.1% | |
| 103.83863 | 6 | 0.1% | |
| 103.85201 | 6 | 0.1% | |
| 103.84667 | 6 | 0.1% | |
| 103.84528 | 6 | 0.1% | |
| 103.84014 | 6 | 0.1% | |
| 103.85192 | 6 | 0.1% | |
| Other values (5404) | 7844 | 99.2% |
| Value | Count | Frequency (%) | |
| 103.64656 | 1 | < 0.1% | |
| 103.66547 | 1 | < 0.1% | |
| 103.68162 | 1 | < 0.1% | |
| 103.6852 | 1 | < 0.1% | |
| 103.68536 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 103.97342 | 1 | < 0.1% | |
| 103.97292 | 1 | < 0.1% | |
| 103.97171 | 1 | < 0.1% | |
| 103.97158 | 1 | < 0.1% | |
| 103.97105 | 1 | < 0.1% |
room_type
Categorical
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 61.8 KiB |
| Entire home/apt | |
|---|---|
| Private room | |
| Shared room | 394 |
| Value | Count | Frequency (%) | |
| Entire home/apt | 4132 | 52.3% | |
| Private room | 3381 | 42.8% | |
| Shared room | 394 | 5.0% |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Length
| Max length | 15 |
|---|---|
| Median length | 15 |
| Mean length | 13.51789554 |
| Min length | 11 |
price
Real number (ℝ≥0)
| Distinct | 374 |
|---|---|
| Distinct (%) | 4.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 169.3329961 |
|---|---|
| Minimum | 0 |
| Maximum | 10000 |
| Zeros | 1 |
| Zeros (%) | < 0.1% |
| Memory size | 61.8 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 35 |
| Q1 | 65 |
| median | 124 |
| Q3 | 199 |
| 95-th percentile | 381 |
| Maximum | 10000 |
| Range | 10000 |
| Interquartile range (IQR) | 134 |
Descriptive statistics
| Standard deviation | 340.1875991 |
|---|---|
| Coefficient of variation (CV) | 2.008985886 |
| Kurtosis | 464.4327957 |
| Mean | 169.3329961 |
| Median Absolute Deviation (MAD) | 64 |
| Skewness | 19.09278291 |
| Sum | 1338916 |
| Variance | 115727.6026 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 60 | 218 | 2.8% | |
| 50 | 209 | 2.6% | |
| 100 | 189 | 2.4% | |
| 150 | 174 | 2.2% | |
| 131 | 171 | 2.2% | |
| 69 | 170 | 2.1% | |
| 200 | 166 | 2.1% | |
| 119 | 152 | 1.9% | |
| 56 | 152 | 1.9% | |
| 81 | 146 | 1.8% | |
| Other values (364) | 6160 | 77.9% |
| Value | Count | Frequency (%) | |
| 0 | 1 | < 0.1% | |
| 14 | 4 | 0.1% | |
| 15 | 5 | 0.1% | |
| 18 | 4 | 0.1% | |
| 19 | 28 | 0.4% |
| Value | Count | Frequency (%) | |
| 10000 | 3 | < 0.1% | |
| 8900 | 2 | < 0.1% | |
| 7000 | 2 | < 0.1% | |
| 6944 | 1 | < 0.1% | |
| 6000 | 1 | < 0.1% |
minimum_nights
Real number (ℝ≥0)
| Distinct | 73 |
|---|---|
| Distinct (%) | 0.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 17.51005438 |
|---|---|
| Minimum | 1 |
| Maximum | 1000 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 61.8 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 1 |
| median | 3 |
| Q3 | 10 |
| 95-th percentile | 90 |
| Maximum | 1000 |
| Range | 999 |
| Interquartile range (IQR) | 9 |
Descriptive statistics
| Standard deviation | 42.09461647 |
|---|---|
| Coefficient of variation (CV) | 2.404025456 |
| Kurtosis | 69.89985985 |
| Mean | 17.51005438 |
| Median Absolute Deviation (MAD) | 2 |
| Skewness | 6.102892196 |
| Sum | 138452 |
| Variance | 1771.956735 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 1 | 2089 | 26.4% | |
| 2 | 1388 | 17.6% | |
| 3 | 1133 | 14.3% | |
| 90 | 514 | 6.5% | |
| 7 | 424 | 5.4% | |
| 30 | 407 | 5.1% | |
| 5 | 401 | 5.1% | |
| 4 | 214 | 2.7% | |
| 6 | 194 | 2.5% | |
| 18 | 181 | 2.3% | |
| Other values (63) | 962 | 12.2% |
| Value | Count | Frequency (%) | |
| 1 | 2089 | 26.4% | |
| 2 | 1388 | 17.6% | |
| 3 | 1133 | 14.3% | |
| 4 | 214 | 2.7% | |
| 5 | 401 | 5.1% |
| Value | Count | Frequency (%) | |
| 1000 | 1 | < 0.1% | |
| 700 | 1 | < 0.1% | |
| 500 | 1 | < 0.1% | |
| 365 | 30 | 0.4% | |
| 360 | 3 | < 0.1% |
| Distinct | 208 |
|---|---|
| Distinct (%) | 2.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 12.80738586 |
|---|---|
| Minimum | 0 |
| Maximum | 323 |
| Zeros | 2758 |
| Zeros (%) | 34.9% |
| Memory size | 61.8 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 2 |
| Q3 | 10 |
| 95-th percentile | 66 |
| Maximum | 323 |
| Range | 323 |
| Interquartile range (IQR) | 10 |
Descriptive statistics
| Standard deviation | 29.70774597 |
|---|---|
| Coefficient of variation (CV) | 2.319579209 |
| Kurtosis | 25.41366328 |
| Mean | 12.80738586 |
| Median Absolute Deviation (MAD) | 2 |
| Skewness | 4.42551345 |
| Sum | 101268 |
| Variance | 882.5501706 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 0 | 2758 | 34.9% | |
| 1 | 1084 | 13.7% | |
| 2 | 592 | 7.5% | |
| 3 | 373 | 4.7% | |
| 4 | 258 | 3.3% | |
| 5 | 218 | 2.8% | |
| 6 | 187 | 2.4% | |
| 7 | 142 | 1.8% | |
| 8 | 130 | 1.6% | |
| 9 | 117 | 1.5% | |
| Other values (198) | 2048 | 25.9% |
| Value | Count | Frequency (%) | |
| 0 | 2758 | 34.9% | |
| 1 | 1084 | 13.7% | |
| 2 | 592 | 7.5% | |
| 3 | 373 | 4.7% | |
| 4 | 258 | 3.3% |
| Value | Count | Frequency (%) | |
| 323 | 1 | < 0.1% | |
| 307 | 1 | < 0.1% | |
| 296 | 2 | < 0.1% | |
| 291 | 1 | < 0.1% | |
| 289 | 1 | < 0.1% |
| Distinct | 1001 |
|---|---|
| Distinct (%) | 19.4% |
| Missing | 2758 |
| Missing (%) | 34.9% |
| Memory size | 61.8 KiB |
| 2019-08-12 | 152 |
|---|---|
| 2019-08-11 | 128 |
| 2019-08-13 | 110 |
| 2019-08-10 | 87 |
| 2019-08-08 | 78 |
| Other values (996) |
| Value | Count | Frequency (%) | |
| 2019-08-12 | 152 | 1.9% | |
| 2019-08-11 | 128 | 1.6% | |
| 2019-08-13 | 110 | 1.4% | |
| 2019-08-10 | 87 | 1.1% | |
| 2019-08-08 | 78 | 1.0% | |
| 2019-08-04 | 74 | 0.9% | |
| 2019-08-05 | 66 | 0.8% | |
| 2019-08-25 | 64 | 0.8% | |
| 2019-07-29 | 62 | 0.8% | |
| 2019-07-31 | 62 | 0.8% | |
| Other values (991) | 4266 | 54.0% | |
| (Missing) | 2758 | 34.9% |
Unique
| Unique | 423 ? |
|---|---|
| Unique (%) | 8.2% |
Length
| Max length | 10 |
|---|---|
| Median length | 10 |
| Mean length | 7.558366005 |
| Min length | 3 |
| Distinct | 527 |
|---|---|
| Distinct (%) | 10.2% |
| Missing | 2758 |
| Missing (%) | 34.9% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.043668674 |
|---|---|
| Minimum | 0.01 |
| Maximum | 13 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 61.8 KiB |
Quantile statistics
| Minimum | 0.01 |
|---|---|
| 5-th percentile | 0.05 |
| Q1 | 0.18 |
| median | 0.55 |
| Q3 | 1.37 |
| 95-th percentile | 3.8 |
| Maximum | 13 |
| Range | 12.99 |
| Interquartile range (IQR) | 1.19 |
Descriptive statistics
| Standard deviation | 1.285851237 |
|---|---|
| Coefficient of variation (CV) | 1.232049279 |
| Kurtosis | 8.021118779 |
| Mean | 1.043668674 |
| Median Absolute Deviation (MAD) | 0.44 |
| Skewness | 2.330718695 |
| Sum | 5373.85 |
| Variance | 1.653413403 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 1 | 172 | 2.2% | |
| 0.04 | 104 | 1.3% | |
| 0.08 | 96 | 1.2% | |
| 0.05 | 93 | 1.2% | |
| 0.1 | 92 | 1.2% | |
| 0.12 | 92 | 1.2% | |
| 0.06 | 91 | 1.2% | |
| 0.15 | 75 | 0.9% | |
| 0.16 | 75 | 0.9% | |
| 0.14 | 74 | 0.9% | |
| Other values (517) | 4185 | 52.9% | |
| (Missing) | 2758 | 34.9% |
| Value | Count | Frequency (%) | |
| 0.01 | 3 | < 0.1% | |
| 0.02 | 61 | 0.8% | |
| 0.03 | 72 | 0.9% | |
| 0.04 | 104 | 1.3% | |
| 0.05 | 93 | 1.2% |
| Value | Count | Frequency (%) | |
| 13 | 1 | < 0.1% | |
| 12.6 | 1 | < 0.1% | |
| 12 | 1 | < 0.1% | |
| 11.03 | 1 | < 0.1% | |
| 8.37 | 1 | < 0.1% |
calculated_host_listings_count
Real number (ℝ≥0)
| Distinct | 55 |
|---|---|
| Distinct (%) | 0.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 40.60768939 |
|---|---|
| Minimum | 1 |
| Maximum | 274 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 61.8 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 2 |
| median | 9 |
| Q3 | 48 |
| 95-th percentile | 203 |
| Maximum | 274 |
| Range | 273 |
| Interquartile range (IQR) | 46 |
Descriptive statistics
| Standard deviation | 65.13525309 |
|---|---|
| Coefficient of variation (CV) | 1.604012789 |
| Kurtosis | 4.166080549 |
| Mean | 40.60768939 |
| Median Absolute Deviation (MAD) | 8 |
| Skewness | 2.149585925 |
| Sum | 321085 |
| Variance | 4242.601196 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 1 | 1965 | 24.9% | |
| 2 | 644 | 8.1% | |
| 3 | 339 | 4.3% | |
| 274 | 274 | 3.5% | |
| 4 | 216 | 2.7% | |
| 203 | 203 | 2.6% | |
| 67 | 201 | 2.5% | |
| 6 | 192 | 2.4% | |
| 7 | 189 | 2.4% | |
| 8 | 184 | 2.3% | |
| Other values (45) | 3500 | 44.3% |
| Value | Count | Frequency (%) | |
| 1 | 1965 | 24.9% | |
| 2 | 644 | 8.1% | |
| 3 | 339 | 4.3% | |
| 4 | 216 | 2.7% | |
| 5 | 175 | 2.2% |
| Value | Count | Frequency (%) | |
| 274 | 274 | 3.5% | |
| 203 | 203 | 2.6% | |
| 157 | 157 | 2.0% | |
| 141 | 141 | 1.8% | |
| 114 | 114 | 1.4% |
| Distinct | 359 |
|---|---|
| Distinct (%) | 4.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 208.7263185 |
|---|---|
| Minimum | 0 |
| Maximum | 365 |
| Zeros | 1386 |
| Zeros (%) | 17.5% |
| Memory size | 61.8 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 54 |
| median | 260 |
| Q3 | 355 |
| 95-th percentile | 365 |
| Maximum | 365 |
| Range | 365 |
| Interquartile range (IQR) | 301 |
Descriptive statistics
| Standard deviation | 146.1200345 |
|---|---|
| Coefficient of variation (CV) | 0.7000556306 |
| Kurtosis | -1.602890685 |
| Mean | 208.7263185 |
| Median Absolute Deviation (MAD) | 104 |
| Skewness | -0.3055947725 |
| Sum | 1650399 |
| Variance | 21351.06448 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 0 | 1386 | 17.5% | |
| 365 | 843 | 10.7% | |
| 364 | 336 | 4.2% | |
| 362 | 150 | 1.9% | |
| 358 | 131 | 1.7% | |
| 359 | 117 | 1.5% | |
| 363 | 116 | 1.5% | |
| 361 | 85 | 1.1% | |
| 356 | 80 | 1.0% | |
| 360 | 80 | 1.0% | |
| Other values (349) | 4583 | 58.0% |
| Value | Count | Frequency (%) | |
| 0 | 1386 | 17.5% | |
| 1 | 15 | 0.2% | |
| 2 | 19 | 0.2% | |
| 3 | 16 | 0.2% | |
| 4 | 11 | 0.1% |
| Value | Count | Frequency (%) | |
| 365 | 843 | 10.7% | |
| 364 | 336 | 4.2% | |
| 363 | 116 | 1.5% | |
| 362 | 150 | 1.9% | |
| 361 | 85 | 1.1% |
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
Phik (φk)
Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.Cramér's V (φc)
Cramér's V is an association measure for nominal random variables. The coefficient ranges from 0 to 1, with 0 indicating independence and 1 indicating perfect association. The empirical estimators used for Cramér's V have been proved to be biased, even for large samples. We use a bias-corrected measure that has been proposed by Bergsma in 2013 that can be found here.First rows
| id | name | host_id | host_name | neighbourhood_group | neighbourhood | latitude | longitude | room_type | price | minimum_nights | number_of_reviews | last_review | reviews_per_month | calculated_host_listings_count | availability_365 | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 49091 | COZICOMFORT LONG TERM STAY ROOM 2 | 266763 | Francesca | North Region | Woodlands | 1.44255 | 103.79580 | Private room | 83 | 180 | 1 | 2013-10-21 | 0.01 | 2 | 365 |
| 1 | 50646 | Pleasant Room along Bukit Timah | 227796 | Sujatha | Central Region | Bukit Timah | 1.33235 | 103.78521 | Private room | 81 | 90 | 18 | 2014-12-26 | 0.28 | 1 | 365 |
| 2 | 56334 | COZICOMFORT | 266763 | Francesca | North Region | Woodlands | 1.44246 | 103.79667 | Private room | 69 | 6 | 20 | 2015-10-01 | 0.20 | 2 | 365 |
| 3 | 71609 | Ensuite Room (Room 1 & 2) near EXPO | 367042 | Belinda | East Region | Tampines | 1.34541 | 103.95712 | Private room | 206 | 1 | 14 | 2019-08-11 | 0.15 | 9 | 353 |
| 4 | 71896 | B&B Room 1 near Airport & EXPO | 367042 | Belinda | East Region | Tampines | 1.34567 | 103.95963 | Private room | 94 | 1 | 22 | 2019-07-28 | 0.22 | 9 | 355 |
| 5 | 71903 | Room 2-near Airport & EXPO | 367042 | Belinda | East Region | Tampines | 1.34702 | 103.96103 | Private room | 104 | 1 | 39 | 2019-08-15 | 0.38 | 9 | 346 |
| 6 | 71907 | 3rd level Jumbo room 5 near EXPO | 367042 | Belinda | East Region | Tampines | 1.34348 | 103.96337 | Private room | 208 | 1 | 25 | 2019-07-25 | 0.25 | 9 | 172 |
| 7 | 241503 | Long stay at The Breezy East "Leopard" | 1017645 | Bianca | East Region | Bedok | 1.32304 | 103.91363 | Private room | 50 | 90 | 174 | 2019-05-31 | 1.88 | 4 | 59 |
| 8 | 241508 | Long stay at The Breezy East "Plumeria" | 1017645 | Bianca | East Region | Bedok | 1.32458 | 103.91163 | Private room | 54 | 90 | 198 | 2019-04-28 | 2.08 | 4 | 133 |
| 9 | 241510 | Long stay at The Breezy East "Red Palm" | 1017645 | Bianca | East Region | Bedok | 1.32461 | 103.91191 | Private room | 42 | 90 | 236 | 2019-07-31 | 2.53 | 4 | 147 |
Last rows
| id | name | host_id | host_name | neighbourhood_group | neighbourhood | latitude | longitude | room_type | price | minimum_nights | number_of_reviews | last_review | reviews_per_month | calculated_host_listings_count | availability_365 | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 7897 | 38092051 | New Small Room @Orchard/Somerset/Central Area | 262337792 | Herman | Central Region | River Valley | 1.29482 | 103.83809 | Private room | 40 | 9 | 0 | NaN | NaN | 11 | 258 |
| 7898 | 38092142 | Well connected 2 bedroom 2 bathroom apartment ! | 223622603 | Tanya | Central Region | Rochor | 1.30082 | 103.84956 | Entire home/apt | 200 | 2 | 0 | NaN | NaN | 4 | 347 |
| 7899 | 38094671 | SMALL ROOM FOR ONE @SOMERSET/ORCHARD/CENTRAL AREA | 262337792 | Herman | Central Region | River Valley | 1.29369 | 103.83768 | Private room | 33 | 7 | 0 | NaN | NaN | 11 | 359 |
| 7900 | 38102097 | 环境优雅的公寓 | 286260560 | Bo | West Region | Bukit Batok | 1.35654 | 103.76028 | Private room | 90 | 3 | 0 | NaN | NaN | 1 | 83 |
| 7901 | 38104971 | 2 PAX LOFT Close To Kent Ridge Park | 278109833 | Belle | Central Region | Queenstown | 1.27973 | 103.78751 | Entire home/apt | 100 | 3 | 0 | NaN | NaN | 31 | 61 |
| 7902 | 38105126 | Loft 2 pax near Haw Par / Pasir Panjang. Free Wifi | 278109833 | Belle | Central Region | Queenstown | 1.27973 | 103.78751 | Entire home/apt | 100 | 3 | 0 | NaN | NaN | 31 | 61 |
| 7903 | 38108273 | 3bedroom luxury at Orchard | 238891646 | Neha | Central Region | Tanglin | 1.29269 | 103.82623 | Entire home/apt | 550 | 6 | 0 | NaN | NaN | 34 | 365 |
| 7904 | 38109336 | [ Farrer Park ] New City Fringe CBD Mins to MRT | 281448565 | Mindy | Central Region | Kallang | 1.31286 | 103.85996 | Private room | 58 | 30 | 0 | NaN | NaN | 3 | 173 |
| 7905 | 38110493 | Cheap Master Room in Central of Singapore | 243835202 | Huang | Central Region | River Valley | 1.29543 | 103.83801 | Private room | 56 | 14 | 0 | NaN | NaN | 2 | 30 |
| 7906 | 38112762 | Amazing room with private bathroom walk to Orchard | 28788520 | Terence | Central Region | River Valley | 1.29672 | 103.83325 | Private room | 65 | 90 | 0 | NaN | NaN | 7 | 365 |